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Abstract 


A method for efficiently coding natural images using a vector- 
quantized variable-blocksized transform source coder is presented. The 
method, Mixture Block Coding (MBC), incorporates variable-rate coding 
by using a mixture of discrete cosine transform (DCT) source coders. 
Which coders are selected to code any given image region is made through 
a threshold, driven distorti-. n criterion. In this paper, MBC is used in two 
different cat ions. 'The base method is concerned with s:ngle-pai>s low- 
r ( i iV>1 l -.a;«:i[C'.>.i'r n. The «cc nd is a natural extension of the 
base method which allows for low-rate progressive transmission (FT;. 
Since the bewe method adapts easily to progressive coding, it offers the 
aesthetic advantages of progressive co-ding without incorporating excessive 
channel overhead." Image compression rates of approximately 0.5 bii/pcl 
are demonstrate \ for both munochrome and color imago. 


1 Introduction 
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of th'* dificrei :t 1 koo j.s iz -'-s, variable-rate coding is achieved. When trying 
to maximize the qualitv uf the reconstructed image, Writer usage of coding 
bits cam be attained with the use of vector quantization. The goal is to de- 
scribe a method for ceding images using both vector quantization and 
multiple-block: ire transform coding. A block- threshold technique is used 
to select the blocksize us-'d to cixie any particular region of the image. 

poth vector quantization and transform coding are block coding 
methods. Block coding methods are %ery useful when designing low-rate 
linage c c • t i ’ ' ' r - ■ s s i i : n sterns, arid are used almost cxclus^ely when c c*d i l, g 

at low rates l < I bit/pel). Traditional methods presented in the literature 
use either of there techniques, but have rarely use both together until very 
recently. One of the earlier publication to do this appeared in 19S4 [1]. 


When using traditional vector quantizers for coding images, small 
blocksizcs are used to limit the size of the required quantizer codcboo*. 
But, when coding natural images with a very small number of bits it is 
desirable to use as large a blocksize as possible to take maximum 
advantage of the high inter-pixel correlations. In general, this blocksize is 
usually larger than vector quantization techniques can comfortably^ handle. 
One method to overcome this problem incorporates subsampled sector 
quantization [2], but little has been done to use traditional vector quanti- 
zation with variable-rate coding. 


Small codebooks are needed because the best performing vector 
quantization techniques (i.e., the LBG method [3]) are clustering techni- 
ques whose codebooks are very unstructured. As a result, the codebooks 
are difficult to construct and use. If a codebook is designed to be less 
computational! v intensive, such as with lattice quantizers [4], or pw^mid 
vector quantizers [5], the attainable distortion per codeword increases for a 
given coding rate. 


Transform coding techniques easily allow for the use of large block- 
sizcs so high data compression ratios can be attained. But. it is difficult 
to keep good high-detail resolution when using large transform blocksizes 
[G]. This is true since most methods code only the low-frcqucncy high- 
energy transform coefficients (7j. As a result, the high-frequency coef- 
ficients are often ignored. Since these coefficients carrying most of the 
information about the image’s finer detail image quality suffers. Even 
when the image coefficients are vector quantized so more coefficients can 
be coded for a given average rate, it can still be difficult to get good high- 
detail resolution. 

Bv using more than one blocksize, some of the inherent problems 
associated with low-rate transform coding car: be overcome. Especially 
when vector quantizers are used to code tne transform coefficients. The 
vector quantizer codebooks used here are of low dimensionality to keep 
their implementation simple. This is done by limiting the number of 
coefficients coded within any given block. For the examples given below, 
the vector quantizers code at most three corflicients as a vector for 
monochrome images, and nine coefficients (thf c transform pels) for color 
imapes. Newer more than four transform pels arc coded within any block, 
no matter is *!?e. 

After a block is coded using these r . f Ifi •' i e i j t - ) r m i t c-cl transform 
coders the distortion is measured, to see if it meet.-, a predetermined coding 
threshold. If a block codes poorly, it B divided into four smaller blocks 
and recoded until a distortion threshold is meet c: the minimum blocksize 
is attained. Thus, keeping the overall image integrity high by using the 
smaller blocks to more intensely code the high-detail regions. 

In section 2 the threshold driven MBC coding algorithm is dis- 
cussed. Also, the required overhead sent, to the receiver to describe the 
final block structure of the coder is presented. Section 3 is a presentation 
of the MBC progressive transmission (MBC/PT) modification. In sections 
4 and 5 the transform coder and the vector quantizers used in the example 
are shown. And finally, several examples are presented. 

2 The Threshold Driven Structure of MBC 

As mentioned above, each block of the image is coded using only a 
small number of transform coefficients. The difference between the 
original image and the coded image block is measured, and if the 
difference does not fall below a predetermined threshold, the block is 
divided into four smaller blocks and recoded. A new threshold is then 
applied to sec if any of these blocks need to be divided further. This 
divide-and-test algorithm is continued until tnc entire image is coded with 
distortion that is less than the block threshold levels or the smallest 
blocksize is reached. 

The monochrome images are coded using the maximum absolute 
difference distortion measure. 

d = max l |x t -y l -J 

where the range of i is taken over the image block being coded, and y, is 
the coded value of pixel x,. For color images the maximum mean square 

difference is used, . 

d = max, J(x l -y,) T (x, -y,)/3 
where y, is the coded value of color pel x ( . 


'This work wes supported by the NASA Goddard Space Flight Center 
under grant NAG 5-9 1G. 


912 


PRECEDING PAGE BLANK NOT FILMED 


To demonstrate the MBC method consider the coding of an 
example image segment. In the following it is assumed the image is coded 
with a starting blocksize of 10x16 and a smallest blocksize of 2x2. 

First, a 16x16 block is coded, using the DCF method described 
below, and the distortion t c % e ! for the block is measured. If this distortion 
is greater than the predetermined maximum level for 1 6 x 1 G blocks, 
d min (16x16 }, the block is divided into four $xS blocks for additional 
coding. After each of the SxS blocks is coded their resulting distortion 
levels are compared with the SxS distortion level. dmin(8x8). This 
process is continued until the only image blocks not meeting their given 
distortion threshold are those of size 2x2. Since 2x2 is the smallest 
allowed blocksize. these blocks are transmitted de facto, making no further 
attempt to improve their distortion level. 

Each 16x16 block can be completely coded, using all blccksizes, 
before moving onto the next 16x16 block or all the 16x16 blocks of the 
entire image can be coded before moving to the 8x3 blocks. For MBC, 
this sequence is immaterial. The later method allows one to develop the 
nmeresM ve ; ech mu ut; introduced in the next section. 
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times tiiis ma 
where b leeks 
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Likewise, 
threshold 
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reception of images is nonaesthetic, the need to update the image on a 
framc-by-frame basis (or progressive basis) has arisen. In general, most 
methods found in the literature arc concerned with perfect reconstruction 
of the image. The image is transmitted on a continually improving basis 
until it is completely transmitted so that it can be reconstructed without 
error. This requires the transmission ‘of far more data than is needed to 
attain a visually pleasing reconstruction of the image (as is the case 
considered here).* But, much of this literature is directly applicable to the 
low-rate transmission of images since almost every FT method 
reconstructs a visually acceptable field within a limited number of passes 
[eg-, 12]. 

In the previous section a single an example 16x16 block was coded 
bv passing through all of the necessary blocksize* before moving on to the 
next 16x16 block. But, if the entire image is passed through for each 
blocksize and the difference image is save for additional coding as is 
necessary in the next pass, the MBC method can be used as a PT coder. If 
each pass is immediately transmitted, the receiver can be reconstructing a 
crude representation of the image using these larger blocksize coefficients 
while the coder is processing the next pass. All passes after the first need 
only code the image residuals. The residuals coding information received 
in subsequent passes is added to the “already waiting image of the 
receiver. The image is updated using smaller blocks so it acquires more 
clarity with each pass. Since these blocks are of smaller size, each pass 
updates higher-frequency image components than were coded in the 


previous pass 


Since the first 


passes are coding the diffe 
ct'rinus problem in updatm< 


Is ceded with very few bits the receiver has an 
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c cions of the image which need additional coding 
>r resources. This means the regions ot the image 
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able for d 


bL<ks, 0 bits 


-«::6 block is divided into 6x8 blocks, line next 
. die s-cond 8x5 block is divided. The last four bits 
,h;-d z : block is divided into 2:<2 blocks. Notice 
' -:x Miuie are placed with the bit maps go nor a ted at 
a; b io-eded to code then. 

. . . n 1 ... : , : ,f ;,ol? and d, is the number of bits used to 
: • ■ d.-.u the average coding rate for the blocks of 
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average coding rate for the entire MBC system is 
]-r — VI . p .r,. and T/p t = l. 
cod-d with i-th pass blocksize. 

-,o force the coding method of one blocksize 
e. as is the case for the examples of this paper. So me- 
rriment a! or. in fact, be impossible. Consider the case 
! 6 x 1 6 are code with, say, six DCT coefficients. The 
oudd ceded in a similar fashion but, of course, it is 
code the a 2x2 blocks with six transform coefficients. 
2 x y blocks must be coded with a different method. 
t .,j ft bove. there is no reason to keep the distortion 
p j 5 even possible to change the distortion 
And. since the different blocksizes encompass 
m mo razees it may be desirable to do this. 

j progressive Transmission MBC 

transmission has grown out of need to transmit images 
bandwidth is dramatically smaller titan what is avail- 
reconstruction of full-field imagery. Since slow-scan 


The high detail regions of an image are coded more than once 
vhert using the MBC7PT method. Not only do the high detail regions 
quire a greater channel capacity io transmit their coefficients because 
nailer blocksizes are being used but. it they also require channel resources 
cr;C h of the previous passes. The rate for a MB C/FT coder is calculated 
dug (la), but the pass fractions are no larger constrained to add to one. 

’ fr ' cl - /.)> 

and p,>p. for t~; 

bus, applying this to (la) shows that is it possible^ to have 
, or , r .>Ru 3C . But this can be offset by the fact that MBC/FT may 
m verge to the original image more quickly and require fewer blocks since 
busy sections of the image are coded with information that is taken 
c.m one than a single pass. As is shown in section 5. an MBC/FT image 
m require fewer coding bits to transmit than image of similar quality 


K r/SC/PT 

converge to the oriei 
the bupv sections of 


ng the MBC method. 


to transmit tr 


4 The Transform Coder 


If a large block docs not adequately code a given image region, it 
is divided into smaller blocks and recoded. So there is no strict advantage 
ill using a large number of coefficients so code any particular blocksize. In 
fact, there is a tradeoff between expending more efTort coding the larger 
blocks so fewer smaller blocks are used, and coding the larger blocks 
minimally so to let the threshold algorithm assign more smaller blocks tor 
coding. 

For the examples of this paper it was chosen to code each block 
with only four DCT transform coefficients, including the dc and three 
lowest order frequency coefficients (Figure A). This was done, not so much 
to attain the best overall coding rate. but. to strike a median between PT 
coders which code a minimum of information about a given block [8] and 
those which code a large amount of information per block [9], This accom- 
plishes two things. Firstly, it shows an image can be adequately coded in 
a relatively small number of passes (four for the example* here) using a 
small number of transform coefficients at each pass, and secondly, it shows 
liiat this can be done using a simple coding algorithm for each pass. In 
addition, when using the same transform coder for each pass it is also 
possible to use share the same vector quantizer between all of the passes. 
This saves quantizer design effort. 
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5 The Quantizers 

The following four paragraphs describe the quantizers used to code 
the examples discussed in section 5 of this paper. The remaining para- 
graphs discuss the details of LBG quantizers used. 

When the MBC method was used t » code monochrome image*, 
the dc transform coefficients were coded with an 8-bit linear scalar 
quantizer (LSQ), and the ac transform coefficients were coded as a single 
3-dimensional vector using an LBG vector quantizer whose codcbook was 
of size 256. 

When the MBC/PT method was used to code monochrome 
images, the quantizers for the first pass were different than those cf 
subsequent passes. The these later passes code difference images that have 
nearlv zero means blocks, while the first pass deals with the original image 
which is not zero mean. The dc coefficients for the first pass were coded 
with an 8- bit LSQ. In subsequent passes, the dc coefficients were ceded 
with a 5-bit optimal Laplacian scalar quantizer (OLSQ) [1C]. The non-cc 
coefficients were coded the same for all passes using an LBG vecto, 
quantizer whose codcbook contained 256 vectors. 

The three non-dc coefficients of the YIQ images, when using the 
method MBC, were quantized with a vector quantizer -whose code Look 
contained 1024 vectors. The dc Y-componcnts were quantized with an 6- 
bit LSQ and the dc I- and Q- components were coded with a a- bit OLSQ. 

As with the monochrome MBC/PT method, the YIQ MBC/PT 
wiiere coded differently in the first pass than in the subsequent 
As was the case f- r MBC, the dc Y-compor.ent was quantized with 
\t [ sn wr-il.' the I- and Q-compencnls were quantized with a 5- bit 
Ia tui.'jequent passes, the dc coefficient were quantized with an 
quantize: whose codebook contained 04 vector.,. 1 tic non-dr 

v cc t o r qu a n t i z e : , w j i o se 
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farters that map 
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implement 
acquired f:< 
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this cast, the rate is .358 bits/pel, and the percentage of blocks coded with 
4x4 blocks is 25.25 percent. 

To code the monochrome image with MBC/PT only requires an 
..vira Oil bits/pel over MBC. The overhead needed to code any given 
2, age is a function of the LBG codcbook, and the codebook is a function 
,.r -he training set used. A differently constructed codcbook could ofTer 
'•Yv-rent re*uhs. It is interesting to note that the MQ MBC/PT method 
requires less coding rale to obtain the same image quality (PSNR) as is 
obtained when using MBC alone. It is clear that MBC/PT does not 
require excessive overhead to add the desirable PT feature. 
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construct a “simple to implement vector-quantized 
adaptive transform source coder. 

6 Results 

All of the examples, as listed in Tables 3-6, use the ol2xol2 RGB 
woman /hat picture of the UCLA database. The monochrome examp.es 
use the green (G) color field, while the YIQ images are made u»ing the 
RGB to YIQ transformation matrix of [12]. All the examples use a 
starting blocksize of 16x16 and a final blocksize of 2x2. These tab.es list 
the number of blocks coded for <ach blocksize, and the thresholds used to 
test the quality of the coding passes. Also, the MBC/PT tables list the 
average coding rate that has accumulated after each pass. 

This rate is based upon the average cf the coding bits as spread 
across the entire image, without concern for what fraction of the image is 
coded within anv particular pass. These rates represent the coding rate 
that is required 'to code the image if the coder where to stop with that 
particular pass. Since the remaining passes are yet to be coded, the image 
percentage coded within the indicated pass must be update^ to include the 
image percentages coded in all of subsequent passes. exam p 

consider the MBC/PT rate of Table 4 when stopping at 4x4 blocxs. In 


Table 1. 
scale factors for 

monochrome i 

Blocksize scale factor 

16x16 

60 

8x8 

35 

4x4 

20 

2x2 

10 


Table 2. 

Vector quantizer scale factors for YIQ images. 


Blocksize 

16x16 

8x8 

4x4 

2x2 


scale factor 
80 
35 
20 
10 
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Table 3. 

Monchrome MBC rale (bits/pcl) 


blocksize ^blocks 

% image 

PS NR 

rate 

ihrcshol' 

6 x 16 53-3 

52.05 

23.11 dB 

- 

33 

8x8 10-13 

25.59 

26.22 

- 

33 

4x4 2950 

18.01 

29.61 

- 

33 

2x2 2556 

4.36 

31.01 

.479 



Table 4. 

Monchrome MBC/PT rate (bits/pcl) 


min blocksize ^blocks 

% image 

PSNR 

rate 

threshold 

16x10 1024 

100.00 

23.13 dB 

.063 

33 

8x5 1940 

47.48 

26.-12 

.170 

33 

4x4 3520 

21.45 

29.76 

.358 

33 

2x2 2472 

3.77 

30.94 

.490 

- 



Table 

5. 


YIQ MBC rate 

(bits/pcl) 

min blocks!. 1 

:e 2£ blocks 9c image 

PSNR rate threshold 

i 6 v 1 6 

* 590 57.62 

24.09 dB - 12 

■5 x 5 

976 23. ; 3 

26.77 - 12 

4x4 

24 M 14-92 

29.06 - H 

•7 c •> 

2 e 4 .CM 

29.69 .691 




Table 6. 



YIQ MBC/PT rate 1. bits/pel) 



Tim;: go FSNR rate tkresho 


v 1 5 

1024 Hjj.i'9 24.12 dB .150 12 


- X 1 

17r0 42.97 26,89 .220 12 

: : i 17.52 29.C9 .402 1 1 



;*•_.) 2.78 29.63 .510 



Figure 3. Example Tree Structure 



Figure 4. lowest order DCT coefficients 






